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n  this  paper,  we  present  new  techniques  for  state  assignment  of  finite  state  machines  based 
on  state  machine  decomposition  algorithms. 


A  finite  state  machine  can  be  decomposed  into  smaller  interacting  machines  so  as  to 
optimize  area  and  performance  of  the  eventual  logic  implementation.  A  recently  proposed 
form  of  decomposition,  which  has  been  shown  to  be  superior  to  previous  decomposition 
methods,  involves  identifying  subroutines  or  factors  in  the  original  machine  and  extracting 
these  factors  to  produce  fectored  and  factoring  machines. 


Optimal  state  assignment  corresponds  to  finding  an  optimal  multiple  general 
decomposition  of  a  finite  state  machine.  We  present  state  assignment  techniques  targeting 
two-level  and  multi-level  logic  implementations  based  on  factorization  algorithms  followed 
by  state  assignment  algorithms.  For  the  two-level  case,  we  prove  that  one-hot  encoding  a 
non-trivially  factored  machine  is  guaranteed  to  produce  a  better  result  than  one-hot  encoding 
the  original  machine.  Experimental  results  indicate  that  this  technique  of  factorization 
followed  by  state  assignment  is  superior  to  previous  state  assignment  techniques  for  large 
sequential  machines,  whbn  targeting  either  two-level  or  multi-level  implementations. 
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Abstract 

In  this  paper,  we  present  new  techniques  for  state  as¬ 
signment  of  finite  state  machines  based  on  state  machine 
decomposition  algorithms. 

A  finite  state  machine  can  be  decomposed  into  smaller 
interacting  machines  so  as  to  optimize  area  and  perfor¬ 
mance  of  the  eventual  logic  implementation.  A  recently 
proposed  form  of  decomposition,  which  has  been  shown 
to  be  superior  to  previous  decomposition  methods,  in¬ 
volves  identifying  subrouiines  or  factors  in  the  original 
m2M:hine  and  extracting  these  factors  to  produce  factored 
and  factoring  machines. 

Optimal  state  assignment  corresponds  to  finding  an 
optimal  multiple  general  decomposition  of  a  finite  state 
machine.  We  present  state  assignment  techniques  tar¬ 
geting  two-level  and  multi-level  logic  implementations 
based  on  factorization  algorithms  followed  by  state  as¬ 
signment  algorithms.  For  the  two-level  case,  we  prove 
that  one-hot  encoding  a  non-trtvially  factored  machine 
is  guaranteed  to  produce  a  better  result  than  one-hot  en¬ 
coding  the  original  machine.  Experimental  results  in¬ 
dicate  that  this  technique  of  factorization  followed  by 
state  assignment  is  superior  to  previous  state  assign¬ 
ment  techniques  for  large  sequential  machines,  when 
targeting  either  two-level  or  multi-level  implementa¬ 
tions. 

1  Introduction 

The  problem  of  optimal  state  assignment  involves  find¬ 
ing  a  binary  encoding  of  internal  states  in  a  finite  state 
machine  so  as  to  produce  a  minimum  area  lo^ic  imple¬ 
mentation  after  encoding  and  logic  optimization.  Sev¬ 
eral  techniques  for  state  assignment  have  been  proposed 
in  the  past;  techniques  targeting  two-level  or  sum-of- 
product  implementations  (e.g.  [4],  [7],  [6])  and  more 
recently,  techniques  targeting  multi-level  combinational 
logic  implementations  [2]  [9].  While  the  techniques  tar¬ 
geting  two-level  implementations  attempt  to  minimize 
the  number  of  product  terms  in  the  eventual  imple¬ 
mentation,  the  techniques  targeting  multi-level  logic  at¬ 
tempt  to  minimize  the  number  of  literals  in  the  filial 
implementation. 

In  [4],  the  use  of  multiple- valued  minimization  to  find 
an  optimal  input  (present  state)  encoding  and  therefore 
good  state  assignment  was  introduced  and  results  bet¬ 
ter  than  previously  proposed  techniques  were  presented. 
However,  one  failing  of  this  strategy  is  that  it  ignores 
the  next  state  spsu:e/field  of  the  finite  state  machine 
completely,  sometimes  resulting  in  a  sub-optimal  state 


assignment.  The  problem  was  partially  alleviated  in 
(7],  via  the  use  of  heuristic  output  (next  state)  encoding 
algorithms,  but  the  problem  of  effective  integration  be¬ 
tween  the  input  and  output  encoding  steps  failed  to  be 
solved.  An  encoding  that  satisfies  the  constraints  gen¬ 
erated  by  both  the  input  and  output  encoding  steps  is 
not  guaranteed  to  be  found.  Similzu  problems  afflict  the 
early  techniques  of  state  assimment  that  target  multi¬ 
level  logic  implementations  [1^. 

It  is  often  convenient  to  realize  a  sequential  circuit  as 
an  interconnection  of  two  or  more  .subcircuits  for  area 
and  performance  reasons.  Heuristics  for  state  assign¬ 
ment  and  techniques  for  logic  optimization  in  sequen¬ 
tial  circuit  synthesis  work  better  for  small  circuits  as 
opposed  to  large  ones.  If  a  good  decomposition  can  be 
found,  generally  speaking,  smaller  areas  for  decomposed 
circuits  over  a  single  lumped  circuit  will  result.  The 
decomposition  may  be  attractive  from  a  performance 
point  of  view  as  well.  The  decomposed  circuits  can  be 
clocked  faster  than  the  original  machine  due  to  smaller 
critical  path  delays.  Of  course,  a  truly  optimum  state 
assignment  algorithm  followed  by  an  optimum  logic  op¬ 
timization  step  will  not  work  better  for  a  decomposed 
circuit  as  opposed  to  a  lumped  circuit. 

Decomposition  methods  can  be  classified  into  three 
main  categories  —  parallel,  cascade  and  general  de¬ 
compositions,  corresponding  to  no  interaction,  uni¬ 
directional  interaction,  and  bi-dir.ee tional  interaction  be¬ 
tween  the  decomposed  submachines. 

The  decomposition  of  sequential  machines  was  first 
treated  in  a  formal  way  by  Hartmanis  [5]  in  1960.  The 
work  was  expanded  in  [6]  and  by  others.  Unfortunately, 
cascade  decomposition  has  limited  use  in  the  design  of 
modern  finite  state  machines.  For  example,  specifica¬ 
tions  of  centralized  controllers  in  microprocessor  chips 
do  not  usually  have  good  cascade  decompositions.  In  [3], 
factoring  algorithms  for  findi^  general  decompositions 
of  machines  were  proposed.  These  algorithms  identify 
sets  of  states  in  the  original  machine  with  similar  in¬ 
ternal  relationships  and  represent  these  sets  of  states 
by  a  call  to  a  factoring  submachine  to  produce  a  fac¬ 
tored/decomposed  submachine.  For  large  machines,  the 
areas  of  the  decomposed  realizations  were  smaller  than 
the  lumped  circuits. 

If  factorizing  prior  to  encoding  and  optimizing  a 
given  machine  produces  a  better  result  than  encoding 
and  optimization  without  decomposition,  it  means  that 
the  heuristic  encoding  techniques  used  produced  a  sub- 
optimal  result  in  the  latter  case.  In  fact,  given  the  de¬ 
composed  machines,  an  encoding  can  be  constructed  for 
the  original  machine  which  produces  a  result  at  least  of 
the  same  quality  as  the  former  case.  The  problem  of 
optimal  state  assignment  corresponds  to  finding  an  op¬ 
timal  multiple  general  decomposition  of  a  machine  (by 
multiple  decomposition  we  mean  decomposition  in  sev¬ 
eral  components  rather  than  just  two). 

In  this  paper,  we  present  new  techniques  for  state 
assignment  of  finite  state  machines,  targeting  both  two- 


level  and  multi-level  logic  implementations,  based  on 
state  machine  factorization  followed  by  the  state  as¬ 
signment  techniques  of  [4]  amd  [2].  If  the  machine  can 
be  non-trivially  factored,  we  prove  that  a  better  state 
assignment  than  one-hot  encoding  is  guaranteed  using 
these  techniques  in  the  two-level  case.  This  means  that 
factorizing  a  machine  prior  to  using  a  KISS-style  [4] 
algorithm  has  a  upper  bound  on  the  number  of  result¬ 
ing  product  terms  that  is  lower  than  the  upper  bound 
when  using  a  KISS-style  algorithm  on  the  original  ma¬ 
chine.  Since  this  upper  bound  when  using  KISS-style 
algorithms  is  a  tight  one,  lowering  it  almost  always  im¬ 
proves  the  results.  While  we  can  only  prove  a  weaker 
result  for  the  more  complicated  muUi-ltvel  case,  experi¬ 
mental  results  indicate  that  that  decomposition  followed 
by  state  assignment  produces  superior  results  for  large 
sequential  machines,  in  this  case  as  well.  The  initial 
factorization  results  in  effective  exploitation  of  the  re¬ 
lationships  between  the  input  (present  state)  space  and 
output  (next  state)  space  in  the  decomposed  subma¬ 
chines  for  both  the  twp-level  and  multi-level  cases. 

Basic  definitions  and  notations  used  are  given  in  Sec¬ 
tion  2.  Notions  of  exact  and  ideal  factors  are  presented. 
The  global  strategy  for  state  assignment  is  presented 
along  with  an  illustrative  example  in  Section  3.  Given 
this  strategy,  we  prove  theorems  that  relate  the  number 
of  product  terms  and  literals  in  an  ideally-factored  one- 
hot  encoded  machine  to  a  lumped  one-hot  encoded  ma¬ 
chine  after  optimization.  A  procedure  to  find  ideal  fac¬ 
tors,  given  a  State  Transition  Table  specification,  is  pre¬ 
sented  in  Section  4.  The  procedure  is  modified  to  find 
near-ideal  factors  in  Section  5.  Factorization  techniques 
tailored  to  (eventual)  two-level  and  multi-level  imple¬ 
mentations  are  described  in  Section  6.  Experimental 
results  obtained  on  benchmark  examples  are  presented 
in  Section  7. 

2  Preliminaries 

For  an  example  factorization  of  a  machine,  the  reader 
is  referred  to  [3].  Any  Nr  disjoint  sets  of  Nf  states  can 

be  extracted.as  occurrences, of  a  factor  and  the.decom; 
position  performed  to  obtain  to  submachines  Mi  and 
M'2.  The  complexity  of  the  decomposed  submachines 
is  profoundly  affected  by  the  choice  of  the  factor.  The 
transition  edges  between  the  Nr  sets  of  states  play  an 
important  role  in  determining  the  quality  of  a  factor. 
If  exactly  similar  transition  edge  relationships  exist  be¬ 
tween  these  sets  of  states,  factorization  will  result  in  the 
smallest  possible  number  of  transition  edges  in  the  de¬ 
composed  submachines.  The  flow  of  state  information 
between  the  two  machines.  Mi  and  M^,  will  be  mini¬ 
mal  and  an  economical  realization  will  result.  However, 
if  the  sets  of  states  corresponding  to  the  occurrences  of 
the  foctor  have  dissimilar  transition  edge  relationships, 
the  resulting  submachines  will  be  complex,  i.e.  with 
a  large  numner  of  transition  edges  dependent  on  state 
information  from  the  oth^r  machine. 

We  now  present  definitions  which  will  be  used  to  de¬ 
scribe  the  notions  of  exact  and  ideal  factoring  machines 
(factors).  The  object  being  defined  appears  in  bold 
type. 

A  factor  is  Nr  (>  1)  sets  of  states  and  all  fanout 
edges  from  these  sets  of  states  in  the  given  machine. 
Each  set  of  states  is  called  an  occurrence  of  the  factor 
F  and  is  denoted  Of*.  The  maximum  number  of  states 
in  any  of  the  Nr  occurrences  of  F,  is  denoted  Np  (Np  > 
2). 

A  transition  edge  in  the  occurrence  of  a  factor,  Op', 


is  an  internal  edge  if  it  fans  into  and  fans  out  of  states 
within  Op'.  An  exit  state  in  any  Op'  is  one  which  has 
no  internal  fanout  edges.  A  state  in  Op'  is  an  internal 
state  if  all  edges  from  the  state  fan  into  states  within 
Op'  alone,  i.e.  all  fanout  edges  from  the  state  are  in¬ 
ternal  edges.  An  entry  state  in  any  Op'  is  one  with 
no  internal  fanin  edges.  The  fanin  edges  into  a  Op'  are 
denoted  /in(i).  The  fanout  edges  out  of  a  Op'  are  de¬ 
noted  fout(i).  The  external  edges  outside  of  any  factor 
occurrence  are  denoted  EXT. 

Given  two  occurrences  of  a  factor,  Op^  and  Op^, 
a  8tat<^  correspondpnc**  pair  is  (7;,  72)  3  q,  £ 

Op^,  72  G  Op^.  A  factor  is  defined  to  be  exact  if  (1) 
state  correspondence  oairs  for  all  states  in  Of  ‘  to  states 
in  Of  ^  can  be  found  such  that  no  state  appears  in  more 
than  one  correspondence  pair  and  (2)  for  each  internal 
edge  in  Of*,  el,  if  el—  >  input  D  e2—  >  input  ^  <j> 
for  any  c2  in  Of^,  (el—  >  fanout,  e2—  >  fanout)  and 
(el—  >  fanin,  e2—  >  fanin)  are  state  correspondence 
pairs.  The  definition  of  an  exact  factor  can  be  extended 
for  Nr  >2. 

An  ideal  factor  with  Nr  occurrences,  each  with 
Npii)  +  Ni(i)  +  1  states,  is  an  exact  factor  with  Npii) 
entry  states,  Nj{i)  internal  states  and  a  single  exit  state. 

If  two  edges  in  a  State  Transition  Graph  can  be  rep¬ 
resented  by  the  same  product  term  in  an  encoded  and 
minimized  two-level  implementation,  these  two  edges 
are  said  to  be  mergeable  under  that  encoding. 

3  The  Global  Strategy 

In  this  section,  we  will  describe  the  global  strategy  used 
in  integrating  factorization  algorithms  with  state  assign¬ 
ment  techniques.  An  illustrative  example  will  be  pre¬ 
sented  and  some  theoretical  results  given. 

The  basic  idea  in  our  approach  is  to  identify  factoring 
states  in  a  machine,  and  rather  than  performing  a  fac¬ 
torization,  separately  encode  the  states  in  the  factored 
and  factoring  submachines.  By  s«»parate  encoding  we 
mean  that  each  submachine  is  encoded  using  a  different 
set  of  bits.  The  strategy  is  described  in  detail  below. 

1.  Given  an  original  machine  with  Ns  states.  Nr  dis¬ 
joint  sets  of  states,  each  with  cardinality  Np  are  se¬ 
lected.  These  sets  represent  occurrences  of  a  factor 
and  the  selection  of  these  sets  is  accomplished  us¬ 
ing  factorization  techniques  that  will  be  described 
in  Sections  4,  5  and  6. 

2.  We  perform  two  separate  encoding  steps  using  stan¬ 
dard  state  assignment  techniques.  The  unselected 
and  selected  states  are  encoded  separately,  using 
different  fields  of  Nn  and  M2  bits  respectively. 

3.  The  Np  states  in  each  occurrence  of  F,  Op',  are  en¬ 
coded  using  the  second  field  of  M2  >  ^ogiNp)  bits. 
Each  occurrence  is  coded  in  exactly  the  same  way 
—  corresponding  states  are  given  the  same  code. 

4.  The  unselected  states  are  encoded  using  the  first 
field  of  Ml  bits.  The  states  in  the  Nr  occurrences 
that  were  given  the  same  code  in  Step  3  are  differen¬ 
tiated  using  the  first  field.  Each  occurrence  is  given 
a  code  distinct  from  any  of  the  unselected  states. 
We  thus  require  M2  >  log(Ns  -  Nr  x  Np  -|-  Nr) 
bits. 


Figure  1:  Factorization  prior  to  Encoding 


5.  The  second  field  for  the  unselected  states  is  as  yet 
unspecified.  This  field  can  be  arbitrarily  chosen. 
However,  it  is  beneficial  to  uniformly  code  this  field 
for  ea,ch.of  the.unselected  states  with  a  code  used 
already  m  the  factor  occurrences. 

In  Figure  1,  a  machine  with  10  states  is  shown.  A 
factor  with  2  occurrences,  each  with  3  states,  namely, 
(s4,  sS,  86)  and  (s?,  sS,  s9)  is  selected.  The  factor  is 
ideal  with  a  single  entry  state  in  each  occurrence  (s4, 
s7),  a  single  internal  state  (sS,  s8)  and  a  single  exit 
state  (s6,  s9).  After  factorization,  the  unselected  and 
selected  states  are  one-hot  encoded  separately  as  shown 
in  Figure  2.  We  have,  in  our  example,  4  unselected 
states,  namely,  si,  s2,  s3  and  slO.  This  means  we 
have  6J}its  in  the  first  field  and  3  bits  in  the.  second 
field.  .The  second  field  code  fpr  the,  unselected  states 
was  chosen  to  coincide  with  the  code  assigned  to  the 

exit  states  in  this  case.  Indeed,  as  we  will  show  in  Theo¬ 
rem  3.2,  this  ensures  that  the  factorization  is  maximally 
exploited.  Of  course,  instead  of  one-hot  coding  the  two 
fields,  an  encoding  of  shorter  length  can  be  constructed. 
One  can  use  state  assignment  programs  like  KISS  [4]  and 
MUSTANG  [2]  to  perform  Steps  3  and  4,  depending  on 
whether  two-level  or  multi-level  logic  implementations 
are  targeted. 

We  now  state  and  prove  a  theorem  that  relate  the 
numbers  of  product  terms  after  one-hot  encoding  and 
two-level  optimization  in  the  original  and  factored  ma¬ 
chines. 

Lemma  3.1  .'  Two  edges  that  fan  out  to  different  next 
states  tn  a  machine  are  not  mergeable  under  a  one-hot 
encoding. 

Theorem  3.2  ;  Given  a  machine  M,  let  the  number 
of  product  terms  tn  a  one-hoi  coded  and  logic  mini¬ 
mized  iwo-level  implementation  be  Pq.  If  an  ideal  factor 
F  G  M  is  extracted,  then  the  number  of  product  terms 
obtained  by  one-hot  coding  the  factored  and  factoring 
machines  separately,  Pi,  is  related  to  Pq  by  the  follow- 
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Figure  2;  State  Assignment  After  Factorization 
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ing  inequality. 

Nr~\ 

Po  >  Pi  +  ^  (k 

«=i 

where  |em(01  number  of  product  terms  obtained 

by  one-hot  encoding  and  minimizing  the  e(i)  internal 
edges  in  each  Of'.  The  number  of  encoding  bits  used 
after  factorization  will  be  {Nn  —  1)  x  {Nf  —  1)  —  1  less 
than  for  the  original  machine. 

Proof:  We  first  show  that  id^l  factorization  and  one- 
hot  encoding  do  not  prevent  the  merging  of  edges  that 

are  mergeable  when  one-hot  coding  the  original  ma¬ 
chine.  Due  to  factorization,  the  edges  in  the  different 
occurrences  of  the  factor  cannot  merge.  For  example, 
ei  €  e(0  cannot  merge  with  e^  6  e(;)  if  i  ^  j.  But 
Cl  and  C2  cannot  merge  in  the  one-hot  coded  original 
machine  as  well  because  they  fan  out  to  different  next 
states. 

Also,  edges  in  the  factored  and  factoring  machines 
cannot  merge.  It  is  conceivable  that  a  Cy  6  />n(:)  could 
merge  with  e,  €  e(i)  in  the  original  machine.  However, 
since  F  is  ideal  and  only  has  entry,  internal  and  exit 
states,  it  means  that  no  c,  G  e(i)  fans  into  the  entry 
states  and  Cy  G  fin(i)  can  only  fan  into  some  entry 
state.  Therefore,  these  edges  could  not  have  merged  in 
the  original  machine  either. 

Similarly,  some  G  fout{i)  could  merge  with  Cu,  € 
EXT  in  the  original  machine,  if  they  fan  into  the  same 
state.  The  states  outside  F  are  given  second  field  codes 
corresponding  the  code  of  the  exit  state  in  each  Of' 
(Step  5  above).  Thus,  the  exit  state’s  complete  code 
differs  only  in  the  first  field  from  the  codes  of  states 
outside  F.  This  means  that  fanout  edges  from  the  exit 
state  fout(i)  can  merge  with  external  edges  EXT  in  the 
factorized  machine  also  (by  merging  the  first  field  in  the 
present  state,  since  they  fan  into  the  same  next  state). 

We  now  show  that  the  internal  edges  e(i)  G  Of' 
can  be  coalesced,  unlike  in  the  original  machine.  For 
the  original  machine  we  will  have  |cm(*)|  product 
terms  corresponding  to  these  edges,  after  one-hot  cod¬ 
ing  and  optimization.  The  state  field  is  split  in  two 
parts  in  the  factorized  machine,  /nj  and  /nj.  In  the 
worst  case,  in  the  next  state  logic,  these  two  fields  are 
realized  separately  with  no  sharing  whatsoever.  It  is 
easy  to  see  that  |cm(»)l  product  terms  suffice  to  realize 
fn2,  sin'  e  the  e(i)  are  identical  and  the  fanout  states 


Figure  3:  Smallest  possible  Ideal  Factor 


have  been  coded  the  same  in  /nj.  The  e„(i)  realize  the 
primary  outputs  as  well.  The  /nj  field  can  be  realized 
even  more  simply.  For  each  Op’  we  have  e(i)  edges  with 
the  same  next  state  field  in  frii,  corresponding  to  the 
code  assigned  to  the  Of*  (Step  4  above).  Thus,  these 
edges  can  be  merged  into  a  sing/e  product  term  with  a 
don’t  care  primary  input  vector.  The  states  in  the  prod¬ 
uct  term  correspond  to  all  the  entry  and  internal  states 
in  the  Of*.  Therefore,  the  difference  in  the  number  of 
resulting  product  terms  in  the  factorized  machine  is  at 
least 

Nn  Nk-I 

•=1  »=1 

less  than  the  original  machine. 

The  number  of  encoding  bits  used  in  the  original 
machine  is  Ns-  In  the  factorized  machine,  we  need 
Ns  —  Nr  X  Nr  -f  Nr  bits  for  the  first  field  and  Nr 
for  the  second.  The  reduction  in  bits  is  therefore 
(Nr-1)x{Nr-1)-1.  Q.E.D. 

State  assignment  techniques  do  not  perform  signif¬ 
icantly  better  than  one-hot  encoding  in  terms  of  the 
number  of  product  terms  in  the  final  PLA.  KISS  guar¬ 
antees  that  its  result  is  at,  least  as  small,  as  a  one-hot 
coded  result,  and  in  general  produces  results  with  prod¬ 
uct  terms  slightly  better  than  a  one-hot  coded  imple¬ 
mentation.  NOVA  [8],  a  variation  on  KISS,  produces 
implementations  with  generally  greater  product  terms 
than  KISS  or  one-hot  encoding,  but  saves  on  the  num¬ 
ber  of  encoding  bits  used. 

Thus,  Theorem  3.1  is  quite  a  strong  result,  since  it 
provides  a  deterministic  estimate  of  the  number  of  prod¬ 
uct  terms  in  the  final  implementation.  The  upper  bound 
on  the  number  of  product  terms  resulting  from  encod¬ 
ing  an  ideally  factorized  machine  using  a  KiSS-style  al¬ 
gorithm  is  always  lower  than  using  the  KISS-style  al¬ 
gorithm  on  the  original  machine.  This  upper  bound 
is  found  using  multiple-valued  minimization  [4]  and  as 
mentioned  earlier,  is  a  tight  bound  -  results  signifi¬ 
cantly  better  than  this  are  seldom  obtained.  The  larger 
the  ideal  factor  (in  terms  of  number  of  states  or  number 
of  occurrences),  the  greater  will  be  the  gains  in  using  the 
strategy  described.  Even  extracting  small  ideal  factors 
will  produce  better  results.  The  smallest  possible  ideal 
factor  is  one  with  2  states  and  2  occurrences, shown  in 
Figure  3.  The  factor  has  one  entry  state  and  one  exit 

state.  It  is  highly  probable  that  at  least  one  of  these 
factors  wiU  exist  in  a  large  machine. 

The  strategy  presented  can  be  generalized  to  multiple 
factorization.  Several  disjoint  ideal  factors  may  exist  in 


a  machine,  that  can  be  simultaneously  factored  to  pro¬ 
duce  cumulative  gains  in  the  number  of  product  terms. 
If  the  factors  are  not  disjoint,  then  choices  have  to  be 
made  as  to  what  factors  to  extract  at  the  expense  of 
losing  others. 

Theorem  3.3  ;  If  N  ideal  factors,  F\,  F2,  ..  F^,  are 
extracted  which  contain  disjoint  sets  of  states,  the  total 
gain  in  the  number  of  product  terms  when  performing 
a  one-hot  coding  after  factorization  over  one-hot  coding 
the  original  machine  is 

N 

G  =  ES; 

>=1 

where  gj  is  the  gain  corresponding  to  extracting  F- 
alone. 

Proof:  We  now  have  V  -(-  1  fields  that  are  coded  sep¬ 
arately,  corresponding  to  the  N  sets  of  states  in  each 
factor  and  the  V-fl-tn  corresponding  to  the  unselected 
states.  By  the  airguments  of  Theorem  3.2,  none  of  the 
fin(j,  i)  can  merge  with  e(_j,  i)  in  the  original  or  fac¬ 
tored  machines  for  any  Fj.  The  one-hot  coding  on  the 
factored  machine  is  performed  as  follows.  Every  occur¬ 
rence  Ofy*  has  a  distinct  N  -I-  1-th  field.  Each  of  the 
states  (including  the  exit  state)  in  the  Fj  and  the  un¬ 
selected  states  are  given  a  ib-th  field  code  equal  to  the 
exit  state’s  code  of  Ft.  Any  fout{j,  i)  fans  out  of  an 

exit  sta,te  and  the  code  of  this  state  differs  from,  the  un- 
seTected  states  in  the  yV  -t-  1-th  field  alone,  implymg  that 

fout{j,  i)  and  EXT  are  mergeable  in  the  factoriz^  ma¬ 
chine,  if  they  were  mergeable  in  the  original  machine. 
Similarly,  the  codes  of  the  exit  states  in  each  occurrence 
of  the  factor  also  differ  only  in  the  N  ■+  1-th  field  alone, 
and  hence  if  fout{j,  i)  and  fout{k,  1)  could  merge  in  the 
original  machine,  they  are  mergeable  in  the  factorized 

gain  due  to  extracting  each  factor  Fj  is  due  to 
the  merging  of  the  e(j,  i),  which  are  all  disjoint  and 
there  could  have  been  no  merging  across  the  e(j,  i)  in 
the  original  machine.  Therefore  the  cumulative  gain  is 

N 

G  = 

;=i 

Q.E.D. 

So  far,  we  have  dealt  with  the  two-level  case.  The 
problem  of  state  assignment  for  the  multi-level  case 
corresponds  to  finding  an  encoding  that  minimizes  the 
number  of  literals  in  the  encoded  and  optimized  ma¬ 
chine.  While  we  cannot  deterministically  predict  the  ef¬ 
fects  of  multi-level  logic  optimization,  we  can  relate  the 
number  of  literals  in  the  original  and  factored  machines 
after  one-hot  encoding  and  two-level  logic  minimization, 
but  prior  to  multi-level  optimization. 

Theorem  3.4  .•  Given  a  machine  M,  let  the  number  of 
literals  in  a  one-hot  coded  and  logic  minimized  two-level 
implementation  be  Lq.  If  an  ideal  factor  F  €  M  is  ex¬ 
tracted,  then  the  number  of  literals  obtained  by  one-hot 
coding  the  factored  and  factoring  machines  separately, 
L\,  is  related  to  Lq  by  the  following  inequality. 

Nn-l 

io  >  II  +  E  ^^^(«m(»))  - 

i=l 


Nr  X  |em(ArR)|  -  Nr  x  {Nr  -  1)  -  \EXTm\ 

where  LIT{em{i))  corresponds  to  the  number  of  literals 
obtained  by  one  hot  encoding  and  minimizing  the  e{i) 
internal  edges  in  each  Op'  and  EXTm  corresponds  to 
the  number  of  product  terms  after  one-hot  coding  and 
minimizing  the  external  edges  EXT. 

Proof:  The  arguments  of  Theorem  3.2  corresponding 
to  the  mer^g  of  edges  hold.  However,  the  number  of 
literals  in  tne  external  edges  EXT  in  the  factored  ma¬ 
chine  is  higher  because  the  number  of  literals  in  each 
state  is  2,  rather  than  1.  If  n  external  edges  merge, 
we  have  n  1  literals  in  the  present  state  space  of  the 
factored  machine,  as  opposed  to  n  in  the  original  ma¬ 
chine  (It  is  n  -t-  1  rather  than  2n  because  the  merging 
of  edges  implies  one  of  the  state  fields  in  each  of  the 
edges  is  identical).  Therefore,  the  number  of  extra  lit¬ 
erals  is  \EXTm\  (In  the  worst  case  of  EXT  not  merging 
with  any  e(»)).  The  reduction  in  the  number  of  liter¬ 
als  is  due  to  the  merging  of  the  e(i).  The  number  of 
literals  in  these  edges  in  the  original  machine  is  simply 
LIT(em(i)).  In  the  factored  machine,  we  have 
only  one  set  of  these  e(t)  edges.  The  number  of  literals 
in  this  set  is  not  the  same,  however,  as  in  the  original 
machine  sets,  since  the  number  of  literals  in  the  present 
state  space  is  higher  (the  number  of  literals  in  each  state 
is  2,  rather  than  1).  Consider  the  first  next  state  field, 
/nj.  The  first  present  state  field  will  have  Nfi  liter¬ 
als  in  each  product  term  that  realizes  /nj.  The  second 
present  state  field  will  have  the  same  number  of  liter¬ 
als  as  in  any  em(«)  in  the  lumped  field  of  the  original 
machine.  Now  consider  the  second  next  state  field.  Nr 
roduct  terms  suffice  to  realize  this  field.  The  num- 
er  of  literals  in  each  of  these  product  terms  is  simply 
Np-l,  corresponding  to  the  entry  and  internal  states  in 
F,  since  the  primary  input  field  is  a  don’t  care.  There¬ 
fore,  the  total  number  of  literals  in  the  factored  machine 
is  LinemiNR))  -I-  NrX  \em{NR)\  NrX  (Nr  -  1). 
The  reduction  in  literals  is  then 
Nf,-l 

Y,  LITMi))  -  Nr  X  |e„,(N«)l 
<=i 

-  Nrx(Nr-I)  -  \EXT„\ 

Q.E.D. 

4  Ideal  Factorization 

In  this  section,  we  will  present  an  algorithm  for  finding 
all  ideal  factors  given  a  State  Transition  Graph  descrip¬ 
tion  of  a  machine.  The  techniques  of  [3]  that  ident^y 
exact  factors  cannot  be  used,  since  they  assumed  the  ex¬ 
istence  of  a  starting  state  in  each  occurrence  from  which 
all  other  states  m  the  occurrence  could  be  re^hed-  We 
do  not  make  this  assumption  here  -  an  idesu  factor 
may  have  multiple  entry  states  and  therefore  no  start- 
ingstate. 

These  ideal  factors  are  extracted  prior  to  state  as¬ 
signment  in  order  to  improve  the  performance  of  state 
assignment  algorithms.  The  factors  extracted  may  over¬ 
lap.  Thus,  not  all  factors  can  be  extracted  —  extracting 
one  factor  may  invalidate  the  other.  In  Sections  5  and 
6,  we  discuss  techniques  for  choosing  the  appropriate 
factor(s)  to  extract  in  order  to  maximize  the  reduction 
in  logic  complexity  after  encoding  and  optimization. 

The  proc^ure  starts  with  all  possible  exit  state  sets 
and  traces  the  fanin  of  the  states  so  as  to  identify  ideal 
factors. 


1.  First,  all  sets  of  states  of  cardinality  equal  to  Nr 
whose  fanin  edges  assert  the  same  outputs,  if  driven 
by  the  same  input  combination,  regardless  of  what 
states  they  fan  out  of,  are  found.  These  sets  are 
stored  in  Trj. 

2.  All  sets  of  states  of  cardinality  equal  to  Nr  whose 
fanout  edges  assert  the  same  outputs,  if  driven 
by  the  same  input  combination,  regardless  of  what 
states  they  fan  into,  are  found.  These  sets  are 
stored  in  Tro- 

3.  A  (new)  set  of  states  Sr  €  Trj  is  picked. 

4.  An  attempt  is  made  to  construct  an  ideal  factor(s) 
whose  exit  state  set  corresponds  to  Sr-  This  is 
done  by  tracing  the  fanin  of  Sr.  The  set  of  states 
that  fan  into  each  qt  £  Sr  ,  1  <  »  <  Nr  are  found, 
namely,  fanin(qi). 

5.  A  check  is  made  to  see  if  the  fanin{qi)  are  in  direct 
correspondence.  Each  correspondence  set  must  be¬ 
long  to  Tro  and  the  edge  relationships  between  qi 
and  fanin{qi)  and  within  the  fanin(qi)  must  be 
identical  for  all  t.  If  the  check  fails,  go  to  Step  3. 

6.  F  =  qi  \J  fanin(qi)  may  be  an  ideal  factor,  with 
qi  representing  the  exit  state  of  occurrence  Or‘  and 
fanin(qi)  the  entry  states.  If  so,  F  is  recorded  as 
such. 

7.  The  fanin  of  the  famn{qi)  are  traced  back.  Given 
a  state  s^  6  /an»n(g,),  if  the  /anm(s<j)  are  not  in 
direct  correspondence,  then  s,;  has  to  be  an  entry 
state.  If  s,j  is  not  an  entry  state  (receives  edges 
from  some  s**  6  faniniqi)  or  from  qi),  go  to  3. 

8.  For  all  states  Sij,  that  do  not  have  to  be  entry 
states,  fanin{sij)  are  found.  A  check  is  made  to 
see  if  Sij  can  be  an  entry  state  for  some  ideal  factor. 
Choices  corresponding  to  treating  each  of  these  Sj; 
as  an  internal  or  entry  state  are  exhaustively  ex¬ 
plored.  If  the  Sij  is  chosen  to  be  an  entry  state,  it 
is  merely  added  to  the  current  F.  Else  if  Sij  is  in¬ 
ternal,  fanin{sij)  is  also  added  to  F.  The  various 
F’s  are  checked  for  ideality  and  the  ideal  factors 
are  recorded. 

9.  Steps  7-8  are  repeated  for  the  fanins  of  the 
fanin{sij)  and  so  on. 

The  function  fanin{arg)  above  returns  aU  states  such 
that  an  edge  from  arg  to  each  of  these  states  exists. 

5  Identifying  Near-Ideal  Factors 

While  the  number  of  ideal  factors  in  a  machine  is  typi¬ 
cally  small  and  all  ideal  factors  can  be  enumerated,  there 
may  be  large  numbers  of  near-ideal  factors.  Extracting 
these  factors  does  not  provide  the  gain  corresponding  to 
Theorem  3.2  or  Theorem  3.4,  but  could  proiduce  some 
reduction  in  the  eventual  number  of  product  terms  or 
literals.  We  therefore  have  to  solve  the  problem  of  de¬ 
tecting  non-ideal,  but  good,  factors  and  estimating  the 
gain  in  extracting  them. 

If  one  can  estimate  the  gain  of  a  non-ideal  factor, 
then  a  search  procedure  similar  to  the  one  described  in 
the  previous  section  can  be  used  to  detect  good  factors, 
i.e.  factors  with  large  associated  gains.  This  estimation 


of  cain  is  different  for  the  two-level  or  multi-level  cases 
ana  is  described  in  the  next  section.  In  this  section, 
modifications  to  the  ideal  factor  search  procedure  to  find 
near-idral  factors  are  described.  These  modifications 
are  similar  to  those  proposed  in  [3]  to  find  near-exact 
factors. 

1.  Find  similarity  weights  for  all  possible  Nn  sets  of 
states.  These  weights  are  found  on  the  basis  of  I/O 
fanout  and  fanin  relationships  between  each  set  of 
states,  i.e.  the  number  of  input  symbols  for  which 
edges  fanning  out  of  all  states  in  the  set  have  dif¬ 
ferent  outputs.  A  weight  of  zero  would  correspond 
to  exactly  similar  states. 

2.  These  sets  are  ordered  in  terms  of  increasing 
weights  (decreasing  similarity).  Beginning  from 
each  initial  exit  state  set,  the  fanins  of  the  set  are 
traced  as  in  Section  4. 

3.  The  fanin  states  from  the  initial  set  are  found  and 
added  to  the  factor  (Step  5  in  Section  4).  The 
gain  is  extracting  the  current  factor  is  estimated 
(described  in  Section  6).  If  the  gain  is  below  a 
prescribed  value,  the  search  is  terminated  and  a 
new  iteration  with  a  new  initial  set  is  begun.  Else, 
the  factor  and  its  associated  gain  are  recorded  and 
fanin  tracing  continues. 

Thus,  given  Nr,  one  can  find  non-ideal  factors,  esti¬ 
mate  tbe  gains  in  extracting  them  and  select  factors 
with  estimated  gain  greater  than  a  prescribed  value. 
This  value  is  a  function  of  the, number  of  states  in  the 
non- ideal  factor  whose  gam  is  being  estimated.  Larger 
factors,  i.e.  factors  with  more  states,  require  a  greater 
estimated  gain  in  order  to  be  recorded  and  in  order  for 

the  search  to  condnue.  This  is  done  because  of  the  es¬ 
timation  of  gain  for  non-ideal  factors  is  approximate. 

6  Implementation-Specific  Fac¬ 
torization  Techniques 

6.1  Targeting  Two-Level  Logic 

Given  a  marine,  and  all  the  ideal  faicton  in  the  ma- 
chme,  an  appropriate  set  of  factors  has  to  be  selected  so 
as  to  maximize  the  cumulative  gain  in  extracting  these 
factors.  The  factors  may  be  disjoint  or  may  overlap.  In 
the  latter  case,  extracting  one  factor  may  invalidate  the 
other.  Thus,  a  step  that  selects  the  largest  (maximum 
gain),  non  overlapping  set  of  factors  has  to  be  performed 
prior  to  state  enc^ing.  However,  since  the  number  of 
ideal  factors  is  generally  not  very  large,  this  step  can  be 
performed  optimally,  via  exhaustive  search. 

The  issue  extracting  of  non-ideal,  but  ^ood  factors  is 
important.  Since  two-level  implementations  are  quite 
constrained,  even  a  small  non- ideality  in  a  factor  can 
result  in  negligible  gain  in  the  number  of  product  terms 
when  extracting  the  fiictor.  Hence,  ideal  factors  are  al¬ 
ways  extracted  if  they  exist.  A  search  can  be  performed 
on  the  ideally-factored  machine,  using  the  procedure  de¬ 
scribed  in  the  previous  section,  for  good  factors,  or  if  no 
ideal  factors  exist,  on  the  original  machine.  The  gain  of 
non-ideal  factors  in  the  two-level  case  is  measur^  by 
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to  provide  a  relative,  rather  than  absolute  estimate,  cor¬ 
responding  to  the  possible  reduction  in  the  number  of 
product  terms.  As  before,  em(»)  corresponds  to  the 


Table  1:  State  Machine  Statistics 


number  of  product  terms,  by  one-hot  coding  and  mini¬ 
mizing  e(»)  separately.  e'(i)  corresponds  to  e(i)  G  0r‘ 
except  that  corresponding  states  in  each  Of’  are  given 
the  same  codes  (as  when  factoring). 

Near-ideal  factors  are  selectively  extracted  based  on 
their  estimated  maximum  cumulative  gain. 

6.2  Targeting  Multi-Level  Logic 

In  the  multi-level  case,  non-ideal,  but  good  factors  play 
a  more  important  part.  First,  all  ideal  factors  are 
found  and  their  gains  calculated.  However,  these  fac¬ 
tors  are  not  immediately  extracted  as  in  the  two-level 
case.  Near-ideal  factors  with  the  largest  estimated  gain 
in  literals  are  found  using  the  methods  of  Section  5.  The 
gain  of  ideal  and  non-ideal  factors  is  measured  by 

Wr 

^LIT(e„(i))  -  LIT(([Je'(i)  M 

i=l  1=1 

where  em(»)  and  e'(t)  are  the  same  as  before.  A  selection 
of  non-overlapping  ideal  and  near-ideal  factors  is  made 
so  as  to  maximize  the  overall  gain. 

7  Results 

In  this  section,  we  present  some  preliminary  results  us¬ 
ing  the  factorization  algorithms  presented  in  the  previ¬ 
ous  sections,  prior  to  state  assignment. 

The  statistics  of  benchmark  examples  from  the 
MCNC  1987  Logic  Synthesis  Workshop  and  other 
sources  are  given  in  Table  1.  The  examples  were  first 
state  minimized.  In  Table  1,  the  number  of  inputs  (inp), 
outputs  (out),  states  (sta)  and  the  minimum  number  of 
bits  (min-enc)  required  for  encoding  are  given.  In  Table 
2,  comparisons  are  drawn  against  the  state  assignment 
program  KISS.  The  number  of  encoding  bits  used  (eb) 
and  the  number  of  product  terms  (prod)  required  are 
given  for  KISS.  The  results  for  factorization  followed  by 
a  KISS-style  algorithm  are  given  under  FACTORIZE. 
The  CPU  times  required  for  factorization  and  state  as¬ 
signment  were  nominal  in  all  cases.  The  number  of  oc¬ 
currences  of  the  extracted  factor  (occ)  and  the  type  of 
factor  extracted  (typ  =  IDE  for  ideal,  NOI  for  non- 
ideal)  are  indicated  for  each  example.  As  the  results 
indicate,  ideal  (or  close  to  ideal)  factors  exist  in  large 
machines  and  extracting  them  produces  better  results. 
The  smaller  machines  in  the  benchmark  set  (not  shown) 
are  less  amenable  to  factorization  -  in  fact,  KISS  might 
indeed  be  producing  the  minimum  number  of  product 
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Table  2:  Comparisons  for  two-level  implementations 


terms  for  these  examples.  Two  exceptions  are  the  ex¬ 
amples  modl2  and  sreg;  counters  and  shift  registers 
generally  have  ideal  factors  that  can  be  extracted  to 
produce  better  results.  One  cannot  really  lose  by  using 
this  technique  of  factorization  prior  to  using  a  KISS- 
style  algorithm.  As  mentioned  in  Section  5,  when  tar¬ 
geting  two-level  implementations,  it  is  better  to  extract 
a  small  ideal  factor  rather  than  a  larger  non-ideal  one 
and  hence  for  these  examples,  if  a  single  ideal  factor  was 
found,  it  was  extracted. 

In  Table  3,  comparisons  are  drawn  against  the  state 
assignment  program  MUSTANG.  MUSTANG  imple¬ 
ments  two  different  encoding  algorithms  —  algorithms 
based  on  the  present  state  space  (MUP)  and  algorithms 
based  on  the  next  state  space  (MUNI  The  number  of 
encoding  bits  used  (eb)  and  the  number  of  literals  after 
multi-level  logic  optimization  using  MIS  [1]  are  given 
for  FAP  and  FAN  (factorization  followed  by  MUP  and 
MUN  respectively).  MUP  and  MUN  used  a  minimum 
bit  encoding  in  all  cases  and  the  literal  counts  obtained 
are  also  given  in  Table  3.  What  is  interesting  in  the 
results,  especially  for  the  large  examples,  is  that  F.\N 
or  FAP  produce  significantly  better  results  than  either 
MUP  or  MUN,  but  are  themselves  very  close.  Indeed, 
an  initial  factorization  results  in  a  better  integration 
of  the  present  state  and  next  state  coding  strategies  of 
MUSTANG.  Thus,  instead  of  running  both  algorithms 
in  MUSTANG,  only  one  has  to  be  used  to  obtain  as  good 
or  better  results.  The  factors  were  extracted  using  the 
procedure  described  in  Section  6. 

Machines  contl  and  cont2  are  contrived  examples, 
each  with  a  large  ideal  factor  in  them.  Detecting  these 
factors  prior  to  encoding  produces  much  better  results 
than  encoding  alone,  especially  in  the  two-level  case. 
These  examples  bring  out  the  deficiences  in  existing 
state  assignment  algorithms. 

8  Conclusions 

We  have  presented  state  assignment  techniques  tar¬ 
geting  two-level  and  multi-levd  logic  implementations 
based  on  state  machine  factorization  algorithms  fol¬ 
lowed  by  state  assignment  algorithms.  These  techniques 
produce  results  that  are  superior  to  previous  approaches 
to  state  assignment. 
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